Computer-based methods for the mouse full-length cDNA encyclopedia: real-time sequence clustering for construction of a nonredundant cDNA library.

نویسندگان

  • H Konno
  • Y Fukunishi
  • K Shibata
  • M Itoh
  • P Carninci
  • Y Sugahara
  • Y Hayashizaki
چکیده

We developed computer-based methods for constructing a nonredundant mouse full-length cDNA library. Our cDNA library construction process comprises assessment of library quality, sequencing the 3' ends of inserts and clustering, and completing a re-array to generate a nonredundant library from a redundant one. After the cDNA libraries are generated, we sequence the 5' ends of the inserts to check the quality of the library; then we determine the sequencing priority of each library. Selected libraries undergo large-scale sequencing of the 3' ends of the inserts and clustering of the tag sequences. After clustering, the nonredundant library is constructed from the original libraries, which have redundant clones. All libraries, plates, clones, sequences, and clusters are uniquely identified, and all information is saved in the database according to this identifier. At press time, our system has been in place for the past two years; we have clustered 939,725 3' end sequences into 127,385 groups from 227 cDNA libraries/sublibraries (see http://genome.gse.riken.go.jp/).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Construction and Application of Mouse Full-length cDNA Database

Our group has been making effort to collect mouse full-length cDNA clones as large scale Mouse cDNA project. It is aiming at collecting all kind of expressed full-length Mouse cDNA clones. The first phase of the project is to make cataloged non-redundant cDNA clone bank. It is necessary for analysis of sequence data and further applications of cDNA clones. We sequence 3’-end part of all cDNA cl...

متن کامل

Efficient filtering methods for clustering cDNAs with spliced sequence alignment

MOTIVATION Clustering sequences of a full-length cDNA library into alternative splice form candidates is a very important problem. RESULTS We developed a new efficient algorithm to cluster sequences of a full-length cDNA library into alternative splice form candidates. Current clustering algorithms for cDNAs tend to produce too many clusters containing incorrect splice form candidates. Our al...

متن کامل

Assessment of Redundancy and Full-Length Rate of Full-Length Enriched cDNA Libraries

Collection of full-length genes requires libraries with full-length cDNA insert, large-scale sequencing, library assessment, and high-speed sequence clustering. Here we focus on computational methods, such as newly developed computer programs, since our experimental methods had been published previously. Our purpose is the collection of full-length cDNAs, therefore the proportion of full-length...

متن کامل

FANTOM DB: database of Functional Annotation of RIKEN Mouse cDNA Clones

FANTOM DB, the database of Functional Annotation of RIKEN Mouse cDNA Clones, is designed to store sequence information of RIKEN full-length enriched mouse cDNA clones, graphical views of sequence analysis results, curated functional annotation information and additional descriptions, including Gene Ontology terms. RIKEN's Mouse Gene Encyclopedia Project aims to collect full-length enriched cDNA...

متن کامل

Molecular cloning of adenylate kinase from the human filarial parasite Onchocerca volvulus

Adenylate kinases (ADK) are ubiquitous enzymes that contribute to the homeostasis of adeninenucleotides in living cells. In this study, the cloning of a cDNA encoding an adenylate kinase from the filariaOnchocerca volvulus has been described. Using PCR technique, a 281 bp cDNA fragment encoding part ofan adenylate kinase was isolated from an O. volvulus cDNA library. Use of this fragment as a p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genome research

دوره 11 2  شماره 

صفحات  -

تاریخ انتشار 2001